PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 



p5f 



INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 0 : 




(11) International Publication Number: 


WO 98/2S416 


C12N 15/10 


Al- 


(43) International Publication Date: 


2 July 1998 (02.C7.9S) 



(21) International Application Number: PCI7DK97/00567 

(22) International Filing Date: 15 December 1997 (15.12.97) 



(30) Priority Data: 
147 1/96 
0592/97 
0935/97 



20 December 1995 (20,12.95) DK 
23 May 1997 (23.05.97) DK 
14 August 1997 (I4.0S.97) DK 



(71) Applicant (for all designated States except US): NOVO 

NORDISK A/S [DK/DK]; Novo AIIi\ DK-23S0 Bagsvxrd 
(DK). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): BJORNVAD. Mads. 

Eskclund (DK/DK); Novo Nordisk A/S. Novo All*, t 
DK-2SS0 Bagsvzrd (DK). RASMUSSEW Michael. Do!- j 
berg [DK/DK]; Novo Ncrdisk A/S, Novo Alii. DK-2SS0 j 
Bagsvsrd (DK). J0RGENSEN. Per. LinA (DK/DK); Novo j 
Nordisk A/S. Novo A ! 1 i\ DK-2SS0 Bagsvrrd (DK). ' 
BORCHERT. Torten. V-del [DK/DK]; Novo NorJI^ .VS. 
Novo Alii. DK-2330 3^;vxrd (DK). 

(74) Common Representative: NOVO NORDISK AS; Ccrpo:::: 
Pater.ts, Novo Aiid. DK-2SS0 3agsv:srd (DK). 



(81) Designated SUtes: AL, AM, AT, AU, A2. BA. BB, BG, BR, 
BY, CA. CH. CN, CU, CL DE, DK, EE, ES, FI, G3. GE. 
GH, GM, GW, HU. ID, IL, IS, JP, KE, KG. KP, KZ, 
LC. LK, LR, LS. LT. LU, LV, MD, MG. MK, MN. .\(W, 
MX. NO, NZ, PL, PT, RO, RU, SD, SE, SG. SI, SK. SL. 
TJ, TM. TR, TT, UA, UG, US, UZ, VN, YU, ZW, ARPO 
patent (GH, GM, KE, LS, MW, SD, SZ, UG. ZW), Ecnsian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ. TM), E^rrrcan 
patent (AT, BE, CH, DE. DK, ES, FT. FR, GB, GR. tZ IT, 
LU, MC. NL, PT, SE). OAPI patent (BF, BJ, CF. CO. CI. 
CM, GA. CN, ML, MR. NE, SN. TD, TG). 



Published 

With international search report. 



(54) Title: IN VIVO RECOMBINATION 
(57) Abstract 



The prjssfti invir.'.icr. robtes to a incr.:- ic: :•: 
DNA structure. Tr.e :~:thcd is a rorceJ a:::r.::.j; e-. : 



;:;:■. c: i"..:mo!o^ci:'> DNA sequence; situated cn a; leas' '.wo d::T:::r.: 
in a DNA sequence cr.ccd.n; a poiy peptide having an ad-.x-.iijecus 



FOR THE PURPOSES OF ..SFORSIATIOS OSLY 



es used :o identify States par.;.- to the PCT o-. the f.-cr.: pages cf pamphlets publishing intemationa! applications under the PCT. 



AL Alta-iia 

AM Arrr.cnii 

AT Au:::ii 

AU Auiiralu 

AZ Aictfcji^LT 

BA Bojnij md Hen ego vim 

BO Divides 

BF. tielgi-ni 

BF BuhLina Faso 

BC Bulga;:a 

BJ Bcr.i.T 

BR Brazil 

BY Belirua 

CA Canada 

CF Central African Republic 

CC Congo 

Cll SwitrrrlanJ 

CI C6tc d'lvflirt 

CM C irr, croon 

CN China 

CU Cuba 

CZ Czech Republic 

PE Germany 

DK Denmut 

EE Estonia 



E5 
Fl 
FR 
CA 

cn 

CE 

CH 

CN 

CR 

MU 

IE 

[L 

IS 

IT 

JP 

KE 

KC 

KP 

KR 
KZ 
LC 
U 
LK 
LR 



Ftr.\is.i 
ftir.cz 
Gabon 

Ur.il ;-i Kingdom 
Georgia 

Guinea 

Gr:::e 

Hungary 

be land 

Israel 

Iceland 

Inly 

J spin 

Kenya 

Ky:£yutu 

Democntic People' 

Republic of Korea 

Republic of Korea 

Ku ikit an 

Saint Lucia 

Liechtenstein 

Sri Link a 

Liberia 



LS 


Lea who 


SI 




LT 


Liinuania 


SK 


S'cv.-uia 


t.U 


Luxembourg 


SN 


Senegal 


I-V 


Lajvia 


sz 


S*aiiland 


MC 


Monaco 


TD 


Oud 


MU 


Republic of Moldova 


TC 


Togo 


MC 


Madagascar 


TJ 


Tajik at in 


MK 


The former Yugoslav 


TM 


Turkmenistan 




Republic of Macedonia 


TR 


Turkey 


ML 


Mali 


TT 


Trinidad and Tobago 


MN 


Mongolia 


UA 


Ukraine 


MR 


Mauritania 


L'C 


Uganda 


MW 


Malawi 


US 


United States of America 


MX 


Mexico 


uz 


Utbckistan 


N E 


Niger 


VN 


Viet Nam 


NL 


Netherlands 


YU 


Yugoslavia 


SO 


Norway 


ZW 


Zimbabwe 


NX 


New ZiJlxid 






PL 


Poland 






FT 


Portugal 






RO 


Romwia 






RU 


Ruaaiin Federation 






SD 


SucUn 






5E 


Sweden 






SC 


Singapore 







WO 98/28416 



1 



PCT/DK97/00567 



Title: In Vivo Recombination 

Field of the invention 

The present invention relates to a method for in vivo 
5 recombination of homologous DNA sequences. The method is a 

forced artificial evolution resulting in a DNA sequence encoding 
a polypeptide having an advantageous property. 

Background of the invention 

10 Homologous recombination between DNA sequences placed on 

the same plasmid is well known to the skilled artisan (Weber and 
Weissmann, Nucl. Acid. Res. (1983) -11, 5661-5669). EP 252666 Bl 
(Novo Nordisk A/S) , W095/22625 Al (Affymax Technologies N.V.) 
and WO9101087 discloses in vivo recombination of genes placed on 

is the same plasmid. 

EP 449923 Bl (Setratech) discloses a process of inter- 
genic recombination in vivo of partially homologous DMA se- 
quences. The recombination takes place in cells of which the en- 
zymatic mismatch repair system is defective. 

20 J. Biotechnol. (1991) 19, 221-240 dis.closes a process 

for inactivating a gene in the genome of B . Amyloliquefaciens. 
The process involves integration and excision of a pE194 mutant. 

In Chemical Abstracts 118:161925 (1993) the minimal se- 
quence length of homology necessary for recombination is deter- 

25 mined. The method of determination involves integration of a 
pE194 derivative carrying a gluconase gene. 

In J. Bacteriol- (1982) 152, 524-526 a plasmid pBD9 , 
which comprises the two plasmids from Staphylococcus aureus, 
pE194 and pUBHO, is disclosed. Other plasmids based on pE194 or 

30 pUBHO are disclosed in Molecular and General Genetics (1988), 
213, 46*5-70, Molecular and General Genetics (1984), 195, 374-7, 
Plasmid (1981), 6, 67-77 and Genetika, (1986) 22, 2750-7. 

In Yichuan Xuebao (1993), 20, 272-8 (see Chemical Ab- 
stracts 120:1670 (1994)), Chen et . Al. disclose the plasmid 

35 pNW102, a pE194 derivative, in which the thermostable a-amylase 
gene from Bacillus licheniformis is inserted. pNV?102 was trans- 
formed into the Bacillus subtilis strain BF 7658, followed by 
incubation at non-permissive temperature, allowing homologous 
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recombination between the a-amylase genes of p:rW102 and BF 7658. 
The a-amylase produced of the recombinant strains shows the same 
characteristics as the a-amylase from Bacillus licheniformis . 

However, in the related art there is no indication of 
5 that DNA sequences can be recombined in vivo by repeating inte- 
gration and cross-out in order to obtain a gene encoding a 
polypeptide with improved characteristics, such as increased 
stability, improved specificity or higher activity. 

Accordingly, a problem of the invention is to provide an 
10 improved in vivo process of generating new DNA sequences. 

Brief Description of the invention 

The present invention is directed to a in vivo process 
of generating new DNA sequences. It has turned out that new DNA 
is sequences with improved properties can be generated in a fast 
and efficient way from at least two homologous DNA sequences by 
a process involving in vivo exchange of DNA between at least two 
vectors, each containing a homologous DNA sequence and an origin 
of replication. 

20 The process of the invention is characterized by, that 

said exchange of said homologous sequences is done by repeating 
in vivo at least once, said integration and excision of one vec- 
tor into the other. 

Accordingly, in a first aspect the invention relates to 

25 a process for in vivo recombination of homologous DNA sequences, 
comprising the following steps 

(a) incubating a cell containing at least two DNA struc- 
tures, each comprising a DNA sequence and an origin of 
replication, under conditions which favour integration 

30 of one of the DNA-structures into one of the others, 

thereby forming a hybrid DNA structure; 

(b) incubating the cell under conditions which favour cross- 
ing out from said hybrid DNA structure, thereby forming 
novel DNA structures, each comprising a recombined DNA 

35 sequence and an origin of replication; 

(c) repeating steps a)-b) at least once. 
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One of the advantages of the in vivo recombination 
process of the invention is by repeating the exchange in vivo 
between the DNA sequences it is possible to obtain numerous dif- 
5 ferent recombination patterns between said DNA sequences. 

This is illustrated in figure 1 herein, which shows the 
result of in vivo recombination between the two DNA sequences 
Savinase and Savisyn performed as described in example 1. 

In figure 1 Savinase/Savisyn recombined clones such as 
10 clone no. 3, 8 12 are derived from more than one in vivo recom- 
bination event between the two DNA sequences. 

Brief description of the drawing 

Figure 1 illustrates the recombination of two DNA se- 
15 quences, placed on the plasmids pTR , temperature resistant since 
origin of replication is functional at 50°C / and pTS, tempera- 
ture sensitive since origin of replication not functional over 
45°C. Under non-permissive conditions (i.e. at 50°C) , pTS inte- 
grates into pTR, forming a hybrid plasmid. 
20 Figure 2 illustrates the cross out from the hybrid plas- 

mid, forming two new DNA sequences, each placed on a plasmid. 
Repeated crossing in and out may be done by e.g. temperature cy- 
cling or simply by keeping temperature pressure and selecting 
for resistance of pTS (i.e. Erm R ) . 
25 Figure 3 illustrates the plasmids which are obtained af- 

ter repeated integration and crossing out events. 

Further, figures 1 and 2 illustrate a DNA structure sys- 
tem suitable for in vivo measurement of hybrid structure forma- 
tion, and suitable for performing a forced cross out of the DNA 
30 hybrid structure. 

Fig. 1: DNA structure pTS : 

Suitable active promoter directing ex- 
pression of active protein 
Erm R : antibiotic resistance gene 
35 DNA structure pTR comprises: 

Not any active promoter to direct tran- 
scription of GFP 
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GFP: Green Fluorescent Protein 
The Erm R antibiotic resistance gene in pTS is active due 
to the active promoter. 

The GFP gene in pTR is inactive (i.e. the gene is non- 
5 transcribed) due to there is not an active promoter to drive ex- 
pression of the gene. 

Fig. 2: After formation of the hybrid structure accord- 
ing to the invention GFP is now active (driven by the promoter) . 
This makes it possible to in vivo measure the hybrid structure 
10 formation (see below for further details) . 

In contrary the Erm R antibiotic resistance gene is now 
inactive (i.e. the gene is non-transcribed) due to there is not 
an active promoter to drive expression of the gene. Forced cross 
out of the DNA hybrid structure is performed by incubating the 
15 cells under' Erm R antibiotic pressure, thereby forcing a cross 
out of the DNA hybrid structure, since the cell is only able to 
express the Erm R antibiotic resistance gene after the crossing 
out of the DNA structure has taken place. 

Figures 4 and 5: Illustrate the two vectors pSX120 (fig. 
20 4) and pMB430 (fig. 5) used as described in example 1 to in vivo 
recombine the two DNA sequences Savinase and Savisyn. 

pSX120 comprises i) an open reading frame encoding Sav- 
inase, ii) an active promoter directing transcription of Sav- 
inase, and iii) a gene conferring resistance to chloramphenicol 
25 {Cam R ) . 

pMB430 comprises i) a Savizyn DNA fragment, ii) a termi- 
nator (resulting in no transcription through the regions follow- 
ing this) , iii) a gene conferring resistance to erythromycin 
(Ern R ) f and iv) an open reading frame encoding Green Fluorescent 
30 Protein (GFP) . 

Fig. 6: Illustrates the results of Savinase/Savisyn in 
vivo recombination performed as described in example 1. 

The schematic representation shows 12 of the shuf- 
f led/recombined clones. 
35 " Horizontally are given the positions of the 

restriction sites unique for Savizyn. 

The letter !I S" denotes that the sequence at this position 
is identical to the Savinase sequence and the letter "Z" denotes 
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that the sequence stems from Savizyn.. All of the sequenced 
clones are different in their pattern of recombination. 

Definitions 

5 As used herein, the following terms have the following 

meanings : 

The term "DNA sequence" includes any DNA sequence which 
is desirable to modify according to the invention e.g. a DNA se- 
quence encoding, a polypeptide, for example an enzyme, pharma- 

10 ceutically active polypeptides, e.g. insulin growth hormone hu- 
man hormones or growth regulators, and sequences such as promot- 
ers, transcription or translation .regulators and other intracel- 
lular polynucleotides and polypeptides. 

The term "homologous" means the DNA sequences have some' 

15 stretches of identical nucleotides, allowing recombination in 
vivo. Said stretches within the DNA sequences are at least 15 
base pair long; more preferably at least 25 base pair long, even' 
more preferably at least 50 base long, and most preferably at 
least 150 base pair or more. 

20 The term "homologous" comprises stretches of DNA 

sequences that are from 60% identical to DNA sequences that 
differs in only one nucleotide. It is preferred that the 
stretches of DNA sequences are more than 70% identical, more 
preferably at least 80%, especially more than 90% identical, and 

25 more preferred that the stretches of DNA sequences are 95% 
identical, most preferred at least S7%. 

Within the stretches of the DNA sequences of interest, 
the DNA sequence homology referred to above is determined as the 
degree of identity between two sequences indicating a derivation 

30 of the first sequence from the second. The homology may suitably 
be determined by means of computer programs known in the art, 
such as GAP provided in the GCG program package (Program Manual 
for the Wisconsin Package, Version 8, August 1994, Genetics 
Computer Group, 575 Science Drive, Madison, Wisconsin, USA 

35 53711) (Needleman, S.B. and Wunsch, CD., (1970), Journal of Mo- 
lecular Biology, 48, 443-453). Using GAP with the following set- 
tings for DNA sequence comparison: GAP creation penalty of 5.0 
and GAP extension penalty of 0.3, the homologous DNA sequences 
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of interest exhibits a degree of identity preferably as de- 
scribed above. 

The term "Recombined DNA sequence" means the DNA se- 
quence which is a result of a recombination event between at 

5 least two DNA sequences. The recombination event may be a single 
recombination event or may be first a recombination event be- 
tween at least two DNA sequences followed by a cross out at a 
different site than the first recombination event. 

The terra "DNA structure" means a DNA molecule, e.g. a 

o vector, a plasmid, a chromosome (e.g. a Bacillus subtilis chro- 
mosome) , a virus, a phage, a transposon or a genome. 

The term "vector" means sny linear or circular DNA mo- 
lecule that is able to contain a DNA sequence to be recombined 
and an origin of replication. The term includes any vector known 

5 to the skilled artisan e.g. a plasmid or a phage. The DNA se- 
quence is optionally-but not necessarily- under control of a 
promoter . 

The term "a cell" used herein in connection with a in 
vivo recombination prcc.ss of the invention is intended to cover 
o both "a cell" and "a cell population". 

The term "transferring" means that the DNA structure is 
introduced into a cell. The DNA structure may be introduced using 
techniques known in the art for direct introduction of DNA, e.g. 
by use of electroporation , transformation of competent cells, pro- 
5 toplast transformation, transf ection, transduction, conjugation or 
ballistic transformation. 

The term "incubating" (or cultivating, growing) means 
that the cell is treated in such a way that the normal cell 
functions are active, especially the functions necessary for re- 
0 combination. The cell can be incubated on a solid substrate 
(agar petri disc) or the cell can be incubated under submerse 
conditions. It is preferred that the cell is incubated in a test 
tube, e.g. an eppendorph test tube. In the case of a tempera- 
ture-sensitive origin of replication, the test tube is conven- 
es iently placed in an automatic termocycler. Further the cell may 
be incubated in standard shake-flaks. 

The term "origin of replication" (ori) means a region 
where replication of a DNA structure is initiated. The term 
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functional in connection with origin of replication means that 
replication can be initiated "from the origin. The term non- 
functional means that no replication is initiated under the 
given conditions, or that the initiation of replication is redu- 
s ced, e.g. to a level insufficient for survival of a cell con- 
taining a DNA structure with a marker gene and the ori, under 
appropriate selection pressure. 

The term "temperature sensitive" in connection with ori- 
gin of replication or vector means the vector is unable to rep- 

10 licate at e.g. increased temperatures, i.e. non-permissive con- 
ditions, which yet permit growth of the parent cell. 

The term "temperature resistant" in connection with 
origin of replication or vector means the vector is able to 
replicate at increased temperatures. 

is The' term "DNA library" is a library comprising numerous 

different DNA sequences. A DNA library may be made according to 
standard procedures known in the art, such as e.g. by in vitro 
DNA shuf f ling/DNA recombination (WO 95/17413, WO 95/22625) 
and/or Error-prone PCR and cassette mutagenesis. 

20 A DNA library herein nay comprise as low as 2-20 

different sequences, as e.g. a library where only one amino acid 
is varied. Further a DNA library may comprise numerous different 
DNA sequences, e.g. up to 10 10 different sequences. 

2 5 Detailed Description of the Invention 

The present invention comprises a process for in vivo 
recombination of homologous DNA sequences, comprising the steps 

(a) incubating a cell containing at least two DNA struc- 
tures, each comprising a DNA sequence and an origin of 

30 replication, under conditions where i) at least one ori- 

gin of replication is non-functional and ii) the cell is 
under a selective pressure which only allows growth of 
the cell if it contains the DNA structure which is hav- 
ing the non-functional origin (i.e. favouring an inte- 

35 gration of one of the DNA-structures into one of the 

others, thereby forming a hybrid DNA structure) ; 

(b) changing of said conditions to conditions, where fewer 
origins of replication are non-functional than in step 
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a) (i.e. favouring a crossing out from said hybrid DNA 
structure) ; 

(c) repeating steps a)-b) at least once. 

5 The recombined DNA sequence will encode new polypeptides 

having new properties. It is then possible to incorporate a se- 
lection or screening system in the process of the invention in 
order to select and/or screen for desired properties encoded by 
of or more of the recombined DNA sequence (s) . 

o The number of repetition (s) in step (c) above Bay be one 

repetition or it may be more such as 5-10 repetitions or 50-100 
or even higher numbers of repetitions. 

It is preferred that integration is forced (or made fa- 
vorable for survival of the DNA structure) by incubation of the 

5 cell under such conditions where the origin of replication of a 
DNA structure is non-functional, and that cross out is forced bv 
incubation of the cell under such conditions where the origin of 
replication of a DNA structure again is functional. 

In another preferred embodiment the routine depicted 

D above in the steps (a) to (c) it is preferred that the cells are 
kept under constant conditions (e.g. constant temperature and 
selection pressure) where spontaneous cross-in and cross-out 
happens. That is the shuffling procedure in the cells is re- 
peated without changing the conditions repeatedly. 

s In a number of DNA structures, such as in many Bacillus 

DNA structures (vectors) (Bacillus subtilis and Other Gram- 
Positive Bacteria, Sonensheim et al., 1993, American Society for 
Microbiology, Washington D.C.) the origin of replication com- 
prise a ori(+) and a ori (-).'ori(+) controls where replication 

D of the first DNA strand starts, and ori(-) controls the ini- 
tiation point for replication on the second strand- (Bacillus 
subtilis and Other Gram-Positive Bacteria, Sonensheim et al., 
1993, American Society for Microbiology, Washington D.C). 

In an embodiment of the process of the invention, at 

5 least one DNA structure does not comprise ori(-) (i.e. only com- 
prise ori(+)). Preferably the DNA structure which does not com- 
prise ori(-) is a DNA structure which origin (i.e. herein 
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ori(+)) is functional during the hybrid DNA structure formation 
performed according to the invention. 

One advantage of this embodiment of the invention is 
when only ori(+) is functional in said DNA structure, this DNA 
structure is in a single-stranded state for relatively longer 
period, compared to a DNA structure comprising both ori(+) and 
ori(-). This prolonged single-stranded state facilitate the hy- 
brid DNA structure formation according to a process of the in- 
vention. 

Preferably, the DNA structure which does not comprise 
ori(-) as described above is a Bacillus structure, more pre- 
ferred a Bacillus vector. 

In order to illustrate this concept, an example is a 
modified pTR (TR: temperature resistence) Bacillus vector which-, 
does not comprise ori(-) (see figure 1). If the hybrid formation, 
is performed at 50°C only the origin of pTR is functional 
(origin of pTS (TS: temperature sensitive) is not functional) 
(see fig. 1,2). The DNA structure pTR only comprise ori(+) and 
is therefore in a single-stranded state for relatively longer 
period, which facilitate the recombination event and conse- 
quently hybrid formation with the P TS DMA structure (see fig. 

It is more preferred that one DNA structure is a vector 
which is able to replicate under certain (permissive) conditions 
and unable to replicate under other (non-permissive) conditions. 
The vector may, for instance, be one which is temperature- 
sensitive for replication. Thus, the vector may be one which is 
unable to replicate at increased temperatures, which yet permit 
growth of the parent cell. The cell is initially incubated at a 
D temperature permitting vector replication and subsequently after 
integration into the other DNA structure may have taken place, 
incubated at a temperature which does not permit vector repli- 
cation so that the vector is lost from the cells unless inte- 
grated. 

5 The vector may further comprise a selectable marker. In 

this case, the incubation at the non-permissive temperature may 
be conducted under selective conditions to ensure that only 
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cells containing the integrated vector (which includes the DNA 
sequence and the selectable marker), will survive. 

Further, it is preferred to construct a system where it 
is possible in vivo to measure formation of the hybrid struc- 
5 ture. 

Preferably, this is done by constructing a system 
wherein an in vivo screenable or selectable protein is expressed 
only after the hybrid structure has been formed. 

Preferably, said screenable protein is a protein which 
10 is fluorescent under suitable exposure, and in particular said 
fluorescent protein is Green Florescent protein (GFP) or vari- 
ants thereof. 

Green Florescent protein (GFP) or variants thereof and hov 
the GFP's are activated to fluoresce are described in the art 

15 and reference is made to (Crar.eri et al. Nature Biotechnology 

14:315-319 (1996); and Cornack et al. GENE 173:33-38 (1996)) for 
further details. 

One of the advantages of using such a screenable protein 
which is fluorescent under suitable exposure of light, and in 

20 particular GFP or variants thereof, is that it is then possible 
by e.g. fluorescent-activated cell sorting (FACS) technology to 
selectively sort out .cells which comprise said hybrid structure. 
This may eliminate background of cells, which do not comprise 
said hybrid structure after performing of step a) according to 

25 the first aspect of the invention. 

FACS is a widely known screening technology and reference 
is made to (Cormack et al. "FACS-optimized mutants of the green 
fluorescent protein (GFP)" GENE 173:33-38 (1996)) for further 
details. 

30 in order to illustrate a suitable system enabling in 

vivo measurement of the formation of the hybrid structure, an 
example of a suitable system is shown in figure 1 and 2. 

Alternatively to the use of a screenable protein as dis- 
cussed above, it may be advantageously to use a selectable 

35 marker, wherein said selectable marker is only transcriptionally 
active when the hybrid structure is formed. 
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Further, it is preferred to that the crossing out from 
the hybrid DNA structure, in "step b) of the first aspect of the 
invention, is a forced crossing out event. 

Forced crossing out from the hybrid structure may be 

5 performed by: 

constructing a system wherein the DNA hybrid structure 
does not express a selectable marker, and at least one 
of the at least two DNA structures in step a) of first 
aspect is able to express said selectable marker; 
10 incubating the cell under selectable conditions, 

thereby forcing a cross out of the DNA hybrid struc- 
ture, since the cell is- only able to express the 
marker gene after the crossing out of the DNA hybrid 
structure has taken place. 
15 Preferably, selectable marker gene mentioned above is 

expressed on a DNA structure where the origin of replication is 
non-functional during a forced integration of the DNA structures 
into a DNA hybrid structure performed as described above, and 
preferably the selectable marker gene is an antibiotic resis- 
20 tance gene. See below for further description of suitable se- 
lectable marker genes. 

In order to illustrate the concept of forced crossing 
out from the hybrid DNA structure, an example of a suitable sys- 
tem is shown in figure2 1 and 2. 
25 The selectable marker may be any marker known in the art, 

for instance a gene coding for a product which confers anti- 
biotic resistance to the cell, which confers prototrophy to an 
auxotrophic strain, or which complements a defect of the host, 
(e.g. dal genes introduced in a dal- strain; cf. B. Diderichsen 
30 (1986), Bacillus: Molecular Genetics and Biotechnology Applica- 
tions, A.T. Ganesan and J. A. Hoch, Eds., Academic Press, pp. 35- 
46. The selectable marker may, e.g., be excised from a known 
source or present on a vector, e.g. a plasmid, used for the con- 
struction of the DNA structure to be used in the method of the 
35 invention. 

This selection may be accomplished by growing the cells 
under selection pressure for the selection marker encoded by the 
marker gene. 
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The DNA structure (in which the vector is to be 
integrated) may contain a gene encoding any selectable marker, 
e.g. of any type as described above in connection with the 
marker optionally carried by the vector to be used in the method 
5 of the invention. Thus, the marker gene may encode an antibiotic 
resistance such as resistance to kanamycin, tetracyclin, 
ampicillin, erythromycin, chloramphenicol, or a resistance to 
various heavy metals such as selenate, antimony or arsenate. 

In a preferred embodiment, the cell used in the process 

10 is defective or transitorily inactivated in the enzymatic mis- 
match repair system or has a reduced level of mismatch repair. 
In the synthesis of DNA, errors can occur and the resulting non- 
complementary pairs of bases are called mismatches. A process 
for the correction of errors (mismatches) in DNA exists. In E. 

15 coli the errors are very rapidly and accurately detected by two 
enzymes (MutS and MutL) enabling a third enzyme (MutU) to unwind 
the two DNA strands and a fourth enzyme (MutH) to cut the neo- 
synthesized strand on a sequence of DNA (GATC) itself methylated 
later. For a closer explanation, reference is made to EP 449923 

20 (Setratech) , the contents of which hereby are incorporated by 
reference . 

In another preferred embodiment the cell used in the 
process is impaired or hampered in the major recombination en- 
zymes such as AddAB. It has been' shown that a strain impaired in 
25 the AddAB genes shows a higher frequency of homologous recombi- 
nation. Both legitimate and illegitimate recombination frequency 
are increased. But especially the recombination frequency for 
illegitimate recombination is favoured when working with a 
strain impaired or hampered in the AddAB ATP-dependent nuclease 

30 (Keima, R ; Haijema, BJ; Dijkstra, H; Haan, GJ; Venema, G; Bron, 
S (1997) Role of enzymes of homologous recombination in ille- 
gitimate plasraid recombination in Bacillus-subtilis . JOURNAL OF 
BACTERIOLOGY Vol. 179 , No. 4 pp. 1219-1229.) 

In another preferred embodiment the cell used in the 

35 process is over-expressing one of the key recombination enzymes, 
recA. The recA enzyme is involved in homologous recombination by 
promoting renaturation of ssDNA to form the heteroduplex molecules 
necessary for the recombination event and initiates the exchange 
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of- DNA strands. (PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR 
BIOLOGY Vol. 56 , pp. 129-223 * (1997) ) • The recA over-expression 
can be achieved by either classical mutagenesis of the host strain 
or cloning of a highly expressed recA gene. 
5 In another preferred embodiment the cell used in the 

process is subjected to UV, X-ray or DNA damaging agents to 
increase recombination frequency between the two homologous genes 
to be shuffled. Stress like UV, X-ray or DNA damaging agents is 
known to induce the recombination enzymes via the SOS response and 
10 activate enzymes involved in recombination processes like recA. 
(BACILLUS SUBTILIS AND OTHER GRAM-POSITIVE BACTERIA, pp. 529-537 
(1993)). 

In another preferred embodiment the pTR based plasmid used 
as a carrier of one of the genes to be shuffled is replaced by 

is either a low-copy version of pTR or another B. subtilis compatible 
low-copy plasmid like pWVOl, (MOLECULAR AND GENERAL GENETICS Vol. 
249 , No. 1 pp. 43-50 (1995 )), p7A1060, (JOURNAL OF BACTERIOLOGY 
Vol. 142 , No. 1 pp. 315-8 (198C)) or pAMbl , (JOURNAL OF 
BACTERIOLOGY Vol. 157 , No. 2 pp. 445-53 (1984)). 

20 In a further preferred embodiment the one of DNA construct 

comprising one the genes to be shuffled is situated on the 
chromosome of the cell and another DNA construct, comprising 
another gene of interest to be recombined is a plasmid. When 
performing an in vivo recombination process of the invention the 

25 plasmid will be recombined with the DNA construct on the 

chromosome. Said plasmid may e.g. be a temperature sensitive 
plasmid. 

The advantages of using a low-copy pTR, or a system where 
one of the DNA constructs is situated on the chromosome as 

30 described above, is that the hybrid DNA construct after 

recombination of the two homologous genes will be the predominant 
form of the genes in the cell. A high-copy background of non- 
recombined pTR plasmids will inevitably reverse the gene hybrids 
formed in the first round back to wildtype after the second round 

35 of recombination and therefore prohibit any shuffling of the two 
genes. When using the chromosome as a carrier of one of the genes 
no wildtype background can coexist when the pTS has integrated via 
the homologous genes. 
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It is preferred that the cell used in the process is a cell 
suitable for large scale production of the product of the 
recombined DNA sequence. It is preferred that the cell is a 
bacterial cell, more preferred a Bacillus cell, most preferred a 
5 Bacillus subtilis cell. 

The DNA sequences to be recombined may be isolated/cloned 
from organisms containing the sequences in a manner known per se. 
For instance, a suitable oligonucleotide probe may be prepared 
on the basis of a first DNA sequence of interest in order to 

10 obtain a second DNA sequence homologous to the first sequence. 
Alternatively, the DNA sequence can be amplified by PCR based on 
primers designed according to knowledge of conserved regions. 

I different embodiments of the present content is used 
the term M a set of homologous DNA sequences ("set A" "set B", 

15 "setC" etc.)", e.g. in relation to an embodiment of the 

invention relating to a process according to the invention, 
wherein the homologous DNA sequences are comprised of two or 
more different set of homologous DNA sequences ("set A" "set B", 
"setC" etc,)/ a nd wherein each of said set of homologous 

20 sequences ("set A" "set B", "setC" etc.) are inserted in a 

different DNA construct ("DNA construct A", "DNA construct B", 
"DNA construct C" etc.) according to the invention. 

This term is used to illustrate that an in vivo 
recombination process of the present invention may be used to 

25 recombine numerous "sets" of homologous DNA sequences. Each 

individual "set" of homologous DNA sequence (s) may individually 
comprise from only one to numerous (e.g. a "set" comprising a 
DNA library) individual homologous DNA sequence(s). 

Each of said "sets" of homologous DNA sequences ("set A" 

30 "set B" , "setC" etc.) may then be inserted in a different DNA 
construct ("DNA construct A", "DNA construct B", "DNA construct 
C" etc.) according to the invention. Each of said different DNA 
construct are preferably different on important parameters, 
according to the invention, such as different origin of 

35 replication, different selectable markers etc. 

All of said DNA constructs ("DNA construct A", "DNA 
construct B", "DNA construct C" etc..) may then be transferred 
into a cell, according to the invention, and when performing a 
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process for in vivo recombination of .the invention, then each of 
the individual DNA constructs ("DNA construct A", "DNA construct 
B" , "DNA construct C M etc..) may then recombine randomly 
according to the invention. After a few repetitions of a 
5 process, of the invention, this will give a DNA population, in 
the cell population, comprising numerous different combinations 
of different recombined DNA sequences, recombined randomly 
between all "sets" of homologous DNA sequences used in the 
process. 

10 Further, the term "a set of homologous DNA sequences 

("set A" "set B" , "setC" etc.) 1 ' is used herein to illustrate one 
of the advantages of an in vivo recombination process of the 
invention, which is the flexibility of the system to recombine 
both set of homologous DNA sequences comprising one, few and/or 

15 numerous (e.g. big DNA libraries) of individual homologous DNA t 
sequences . 

Accordingly, an embodiment of the invention relates to. 
process according to the invention, wherein said sets of 
homologous DNA sequences each only contain one DNA sequence 
20 (i.e. recombination of two or more homologous sequences). 

Further in embodiments of the invention the homologous,. 
DNA sequences to be recombined are derived from a pre-made DNA 
library and/or derived from individually different pre-made 
libraries . 

25 Homologous DNA derived from such DNA libraries are 

inserted into a vector according to the invention (i.e. DNA re- 
presenting the DNA library is ligated (inserted) into a vector). 
As described herein the process of the invention comprise at 
least two DNA structures (e.g. vectors), which are different in 
30 a way which allow forced in vivo recombination of said vectors. 
Homologous DNA representing DNA libraries may be 
inserted into only one and/or both of said different vectors. 

I.e. if for instance homologous DNA representing one DNA 
library is inserted (ligated) into one of said different vectors 
35 and homologous DNA representing another DNA library is inserted 
into the other vector this will allow fast and efficient 
recombination between two different libraries by an in vivo 
recombination process according to the invention. To illustrate 
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this concept, DNA representing one library may be inserted into 
a pTR vector (see figure 1) a'nd DNA representing another library 
may be inserted into a pTS vector (see figure 1) . Just a few re- 
peating in vivo recombination events performed by a process ac- 
5 cording to the invention will then give a cell population 
representing a new library which is a combination of the two 
libraries mentioned above. 

Accordingly, embodiments of the invention relates to a 
process according to the invention, wherein at least one of the 
10 sets of homologous DNA sequences is derived from a pre-made DNA 
library, and at least one of said sets of homologous DNA 
sequences is a DNA sequence which, is homologous to said DNA 
library (i.e. resulting in a process for in vivo recombination 
of said DNA library to a DNA sequence which are homologous to 
15 said DNA library) ; or 

a process according to the invention, wherein the sets 
of homologous DNA sequences is derived fron the same pre-made 
DNA library (i.e. resulting in a process for in vivo 
recombination of said DNA library) ; or 
20 a process according to the invention, wherein the sets 

of homologous DNA sequences are derived from at least two 
different pre-made DNA libraries (i.e. resulting in a process 
for in vivo recombination of at least two different DNA 
libraries) . 

25 In order to obtain even further recombination between the 

DNA sequences it may be preferred to perform an isolation of the 
DNA structures after a first round of in vivo recombination of 
said DNA sequences performed according to the invention. Said 
isolated structures are then re-transformed into the cells and a 

30 further process round of in vivo recombination is then 
performed. 

The advantages of this step is illustrated by following 
example: 

Set A is a vector A wherein DNA sequences a, b are 
35 introduced individually in said vector A (i.e. the vector A 

population comprises a vector with DNA sequence a, and a vector 
with sequence b) ; and 
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Set B is a vector B wherein DNA .sequences c, d are 
introduced individually in said vector B (i.e. the vector B 
population comprises a vector with DNA sequence c, and a vector 

with sequence d) . 
5 Vector A and B are then transformed into a cell 

population and in vivo recombined according to the invention. 
This will give recombination between DNA sequences ac, ad, be, 
bd. The vectors A, and B comprising said recombined sequences is 
isolated, and re-transformed into said cell population. In vivo 

io recombination according to the invention will then give all 
possible recombination of said four (a,b,c,d) DNA sequences, 
such as recombination of abc, abdc, etc. 

In even further embodiments the invention relates to a 
process according to the invention, wherein two sets of homolo- 

15 gous DNA sequences ("set A" and "set B") are recombined in vivo, 
comprising the steps 

a) inserting sequence(s) frcr. "set A" in a vector contain- 

ing a temperature resistant origin of replication and a 
gene for a first marker; 
20 b) inserting sequence (s) from "set B" in a vector contain- 

ing a temperature sensitive origin of replication and ,a 
gene for a second marker; 

c) transferring the vectors into a cell; 

d) Incubating the cell at a temperature where both origins 
25 of replication are functional, optionally under a selec- 
tion pressure that favours cells with both marker genes; 

e) shifting the temperature to a temperature, where only 
the temperature resistant origin of replication is func- 
tional ; 

30 f) repeating steps d)-e) at least once; 



or 



a process according to the invention, wherein two sets 
35 of homologous DNA sequences ("set A" and "set B») are recombined 
in vivo, comprising the steps 
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a) inserting sequence (s) from "set A" in a vector contain- 
ing a temperature resistant origin of replication and a 
gene for a first marker; 

b) inserting sequence(s) from "set B" in a vector contain- 
5 ing a temperature sensitive origin of replication and a 

gene for a second marker; 

c) transferring the vectors into a cell; 

d) Incubating the cell at a temperature where only the tem- 
perature resistant origin of replication is functional, 

10 optionally under a selection pressure that favours cells 

with both marker genes; 

e) optionally, shifting the temperature to a temperature, 
where both origins of replication are functional; 

f) repeating steps d)-e) at least once:. 

15 

In a further aspect the invention relates to a method for 
producing one or more recombinant protein (s) having a desired 
biological activity comprising: 

performing a process for in vivo recombination of homo- 
logous DNA sequences (the process is performed according 
to the invention as described herein) , wherein at least 
one of the DNA structure (s) is an expression vector; 
expressing the numerous different recombinant polypeptides 
encoded by the numerous different recombined DMA sequences 
from step a) ; and 

screen or select the numerous different recombinant 
proteins from step b) in a suitable screening or selection 
system for one or more recombinant polypeptides (s) having 
a desired activity. 

Preferably, the expression vector in step a) above is a DNA 
structure which is unable to replicate (e.g. having a non- 
functional origin of replication) during a forced hybrid DNA 
35 structure formation (i.e. integration of said structures into 
each other) performed according to a process for in vivo 
recombination of homologous DNA sequences according to the 
invention. This will improve the probability that the expression 



20 a) 



b) 

2 5 

c) 
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vector, after a crossing-out event of said hybrid-structure 
according to the invention, comprise recombined homologous DNA, 
since the only way said expression vector may have survived the 
process of forced hybrid formation is by actually recombine with 
5 another DNA structure, which is able to replicate under the 
conditions used to force said hybrid-structure formation. 

After the process for in vivo recombination have been 
performed (step a) above) , and before the expression and 
screening steps b) and c) above, it may- be advantageously to 

10 make a cell population mainly comprising only the expression 
vector (i.e. other DNA construct(s) are partly or nearly 
completely removed from the cell -population) . This is particular 
advantageously when the expression vector is unable to replicate 
during the forced hybrid DNA structure formation as described 

15 immediately above, since such a cell population will have a very 
limited background of non-recombined DNA sequences of interest, 
which is advantageously in the subsequent screening step (step 
c) above) . 

Making a cell population mainly comprising only the 
20 expression vector nay be performed according to standard pro- 
cedures known in the art, such as growing the cell population at 
conditions selectively favouring replication of the expression 
vector, as compared to the other DNA structures; or selectively 
purify said expression vector form the cells and subsequently 
25 transfer this into a new cell population. 

Expression the recombinant polypeptides encoded by the 
recombined sequence in step a) above may be performed by use of 
standard expression vectors, and/or a standard expression vector 
may be modified in order to comprise elements specially suitable 
30 for a in vivo recombination process according to the invention. 

*A suitable screening or selection system, to screen or se- 
lect the numerous different recombinant proteins from step b) 
above, will depend on the desired biological activity. 

A number of suitable screening or selection systems to 
35 screen or select for a desired biological activity are described 
in the art. Examples are: 
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Strauberg et al. (Biotechnology 13: 669-673 (1995), which 
describe a screening system to screen for Subtilisin variants hav- 
ing a Calcium-independent stability; 

Bryan et al. (Proteins 1:326-334 (1986)), which describe a 
5 screening assay to screen for Protease having a enhanced thermal 
stability; and 

PCT-DK96/00322 which describe a screening assay to screen 
for lipases which are having an improved wash performance in wash- 
ing detergents. 

10 A preferred embodiment of the invention comprise screening 

or selection of recombinant protein (s), wherein the desired bio- 
logical activity is improved performance in dish-washing or laun- 
dry detergents. Examples of suitable dish-wash cr laundry deter- 
gent are disclosed in PCT/DK96/00322 and WO 95/30011. 

15 The invention also comprises a set (at least two) of 

vectors, comprising at least two different origins of replica- 
tion able to function in the same cell under permissive con- 
ditions, and at least two homologous DNA sequences encoding 
polypeptides. In a preferred embodiment, at least one origin of 

20 replication is temperature sensitive. The vector may preferable 
be a plasmid. 

The invention 'further comprises a vector system, com- 
prising at least 

a) a vector A containing a DNA sequence encoding a polypep- 
25 tide and an origin of replication, and 

b) a vector B containing a DNA sequence homologous to the 
DNA sequence in vector A, and an origin of replication 
that differs from the origin of replication in vector A. 
The vectors may preferable be plasmids. 

30 

It is preferred that the DNA sequence encodes polypep- 
tides with a biological activity, especially enzymes and in par- 
ticular enzymes such as an amylase, lipase, cutinase, cellulase, 
oxidase, phytase, and a protease. 
35 The invention further comprises a new, recombined DNA 

sequence, characterised in that has be produced by a process ac- 
cording to the invention, or it is encoding a recombinant pro- 
tein having a desired biological activity, wherein said recombi- 
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nant protein having a desired biological is produced by the 
method according to method for producing one or more recombinant 
protein (s) having a desired biological activity according to the 
invention. 

5 The invention further comprises a process for producing 

a polypeptide, especially an enzyme, comprising i) growing a 
cell expressing a polypeptide encoded by a recorabined DNA se- 
quence according to claim 38, and ii) purifying said expressed 
polypeptide. 

10 The invention further comprises a new polypeptide, espe- 

cially an enzyme, characterised in that it is produced by a 
process according to the invention. 

The invention further comprises a polypeptide, espe- 
cially an enzyme, characterised in that it is encoded of a re- 
15 combined DNA sequence according to the invention. 

Polypeptides of the present invention include peptides and 
proteins, and optionally glycosylated forms thereof. Examples are 
enzymes, pharmaceutically active polypeptides such as analogues 
of human or mammal polypeptides, e.g. insulin, growth hormone, . 
20 human hormones or growth regulators. 

The invention is described in further detail in the 
following examples which are not in any way intended to limit the 
scope of the invention. 

2 5 EXAMPLES 

MATERIALS AND METHODS 

Strains . 

30 E.coli: SJ2 (Diderichsen, B., Wedsted, U., Hedegaard, L. , 

Jensen, B. R. , Sjoholm, C. (1990) Cloning of aldB, which encodes 
alpha-acetolactate decarboxylase, an exoenzyme from Bacillus 
brevis. J. Bacterid., 172, 4315-4321) Electrocompetent cells 

TM 

prepared and transformed using a Bio-Rad GenePulser as 
35 recommended by the manufacturer. 

B.suJbtilis DN1885 Diderichsen, B., Wedsted, U. , Hedegaard, L. , 
Jensen, B. R. , Sjoholm, C. (1990) Cloning of aldB, which encodes 
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alpha-acetolactate decarboxylase, an exoenzyme from Bacillus 
brevis. J. Bacteriol., 172, 4315-4321). Competent cells were pre- 
pared and transformed as described by Yasbin, R.E., Wilson, G. A. 
and Young, F.E. (1975) Transformation and transfection in ly- 
5 sogenic strains of Bacillus subtilis : evidence for selective in- 
duction of prophage in competent cells. J. Bacteriol, 121:296-304. 

JB. subtilis PL1801. This strain is the B. subtilis DN1885 with 
disrupted apr and npr genes (Diderichsen, B-, Wedsted, U. , 
10 Hedegaard, L. , Jensen, B. R. , Sjoholm, C. (1990) Cloning of 
aldB, which encodes alpha-acetolactate decarboxylase, an 
exoenzyme from Bacillus brevis. J. Bacteriol., 172, 4315-4321). 

Plasmids : 

15 pSX120: B . subtilis expression vector (Described in E? 405901, 
and WO 91/00345). This vector comprise the wild-type Savinase 
gene. 

pE194: (Horinouchi , S and Keisblurr., B., 1982, J. Bacteriol. 
150:804-814) . 

20 pSX222: J3. subtilis expression vector (Described in WO 96/34946, 
and WO 92/19729). This vector comprise the synthetic Savinase 
gene herein called Savisyn. In the Savisyn gene is introduced a 
large number of restriction sites as compared to the vild-type 
Savinase gene comprised in pSX120. 

25 

pUC19: Yanisch-Perron, C, Vieria, J. and Messing, J. (1985) 
Gene 33 , 103-119 . 

pF64L-S65T-GFP: is the E.coli plasnid encoding the DNA fragment 
30 encoding the F64 L-S65T-GFP described in WO 97/11094. 

pMUTIN-4-MCS: Plasmid can be obtained from Laboratoire de Geneti- 
que Microbienne, Institut National de la Recherche Agronomique, 
78352. Jouy en Josas - CEDEX, France. 

35 

Solutions /Media 
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TY and LB agar (as described in Ausubel, F. M. et al. 
(eds.) "Current protocols in Molecular Biology". John Wiley and 
Sons, 1995). 

LBPG is LB agar supplemented with 0.5% Glucose and 0.05 M 
5 potassium phosphate, pH 7.0 

LBPGSK plates are LBPG media with agar and 1% of skimmed 

milk. 

Antibiotics are used in different medias in concentrations 
of 6 ng/ml Chloramphenicol, 10 pg/ml Kanamycine and 5 ug/ml og 

10 Erythromycine. 

TE-buffer (as described in Ausubel, F. M. et al. (eds.) 
"Current protocols in Molecular Biology". John Wiley and Sons, 
1995) . 

15 General molecular biology methods 

DMA manipulations and transformations were performed using' 
standard methods of molecular biology (Sambrook et al. (1989) 
Molecular cloning: A laboratory manual, Cold Spring Harbor lab., 
Cold Spring Harbor, NY; Ausubel, F. K. et al. (eds.) "Current 
20 protocols in Molecular Biology". John Wiley and Sons, 1995; 

Harwood, C. R. , and Cutting, S. M. (eds.) "Molecular Biological 
Methods for Bacillus". John VJiley and Sons, 1990). 

Enzymes for DNA manipulations were used according to the 
specifications of the suppliers. 

25 

Enzymes for DNA manipulations 

Unless otherv/ise mentioned all enzymes for DNA 
manipulations, such as e.g. restriction endonucleases , ligases, 
polymerases, PCR polymerases etc., are obtained from New England 
30 Biolabs, Inc. 

Construction of the temperature sensi tive shuffling plasmid 

pMB430 was constructed in a five step cloning process as 
depitched in the following, the entire DNA sequence is found in 
35 SEQ ID No. 1. 



1) 



WO 98/28416 



24 



PCT/DK97/00567 



The temperature sensitive plasmid pE194 was modified by intro- 
ducing multiple cloning sites around the unique Clal site of the 
pE194 plasmid and thus destroying the Clal site. This introduc- 
tion was done by PCR using the primers #22899 and #24039 to am- 
5 plify the whole plasmid and introducing several unique cloning 
sites. 

The PCR was performed as follows: 

1 jxl of pE194 Qiaquick plasmid prep was used for 50 jsl 
PCR amplification in PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM 
10 KCl, 1.5 mM MgCl 2/ 0.01 % (w/v) gelatin) containing 200 /*M of 
each dNTP, 2.5 units of AmpliTag polymerase (Perkin-Elmer , 
Cetus, USA) and 100 pmol of each-piriner: 
#22899 

5 1 -GCAGCTGGATCCGCGGCCGCGAATTCGTCTTTTGCGCAGTCGGC-3 1 

15 

#24039 

5 ' -GCAGCTGGATCCGGGCCCGGGAAGCTTCGATTCACAAAAAATAGGCACACG-3 ' 

The PCR reactions were performed using a DNA thermal 
20 cycler (Landgraf , Germany) . One incubation at 94°C for 1 min 
followed by forty cycles of PCR performed using a cycle profile 
of denaturation at 94°C for 1 min, annealing at 55°C for 1 min, 
and extension at 72°C for 2 min. Five-^1 aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % 
25 agarose gels (NuSieve, FXC) . 

Subclonina of PCR fragment 

Forty-f ive-/il aliquots of the PCR products generated as 
described above were purified using QIAquick PCR purification 

30 kit (Qiagen, USA) according to the manufacturer's instructions. 
The purified DNA was eluted in 50 ^1 of lOmM Tris-HCl, pH 8.5. 
Fifty -^1 of the purified PCR fragment was digested with BamHI, 
electrophoresed in 0.3 % low gelling temperature agarose 
(SeaPlaque GTG , FMC) gels, the relevant fragment was excised 

35 from the gels, and purified using QIAquick Gel extraction Kit 
(Qiagen, USA) according to the manufacturer's instructions. The 
isolated DNA fragment was then ligated to BamHI digested pUC19 
and the ligation mixture was used to transform £. coli SJ2 . 



WO 98/28416 



25 



PCT/DK97/00567 



Cells were plated on LB agar plates containing Erythro- 
mycin (200 pq/nl) supplemented with X-gal (SIGMA, USA) (5-Bromo- 
4-chloro-3-indolyl alpha-D-galactopyranoside, 50 /ig/ml) . 

5 THPntificPtion and characteri sation of positive clones 

The transformed cells were plated on LB agar plates* 
containing Erythromycin (200 fig/ml) supplemented with X-gal (50 
M g/ml) and incubated at 37°C over night. Next day white colonies 
were rescued by restreaking these onto fresh LB- Erythromycin 

10 agar plates and incubated at 37°C over night. The next day 
single colonies of each clone were transferred to liquid LB 
medium containing Erythromycin (200* /ig/ml) and incubated 
overnight at 37°C with shaking at 250 rpm. 

Plasmids were extracted from the liquid cultures using 

15 QIAgen Plasraid Purification mini kit (Qiagen, USA) according to 
the manufacturer's instructions. Five-^l samples of the plasmids 
were digested with BamHI . The digestion were checked by 
gelelectrophoresis on a 0.7 % agarose gel (NuSieve, FMC) . The 
appearance of two DNA fragments of about 2 . 7 kb and 3.7 

20 indicated a positive clone. The clone was designated MB293 and 
the plasmid pMB293 . 

2) 

PMB293 was then EcoRI digested and the 3.8 kb fragment was 
25 gelpurified on a 0 . 7 % agarose gel and using QiaQuick gelpurifi- 
cation kit from Qiagen according to the manufacturer. This frag- 
ment which constitutes the pE194 with inserted multiple cloning- 
sites (originating from the PCR primers and from MCS of pUC19) 
was circulised by ligation and used to transform B.subtilis 
30 DN1885, transformed cells were selected on LBPG plates contain- 
ing 5 pg/ml Erythromycin and incubation was done at 33 °C over- 
night. Clones from this transformation were restreaked and cul- 
tivated overnight in TY with 5 ^ig/ml Erythromycin. From over- 
night- culture broth cells were isolated and plasmids were puri- 
35 fied using Qiaquick plasmid kit. Restriction enzyme digestion of 
the plasmids verified the correctness of the clones. One such 
clone and plasmid were termed MB333 an pMB333. 
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3) 

The pMB3 3 3 was then used as the temperature sensitive plasmid 
for the construction of a plasmid containing the the Savizyn 
gene without any promoter- This construct was made as follows. 

5 

PCR using the primers #21547 and #21548 and the pSX222 as 
the template was performed according to these conditions: 

1 jxl of pSX222 Qiaquick plasmid prep was used for 50 pi 
10 PCR amplification in PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM 
KC1, 1.5 mM MgCl 2/ 0.01 % (w/v) gelatin) containing 200 /zM of 
each dNTP , 2.5 units of AmpliTaq polymerase (Perkin-Elmer , 
Cetus, USA) and 100 pmol of each primer: 
#21547 

15 5«-GCAGCTGCAGGATCCGAATTCGCGGCCGCGCAATCGGTACCATGG-3 * 
#21548 

5 ' -GCAGCGAGCTCAAGCTTCCCGGGCCCAGCCGGTCGACCGCGTTGCCGCTTCTGCG-3 ' 

The PCR reactions was performed using a DNA thermal 
20 cycler (Landgraf, Germany). One incubation at 94°C for 1 min 
followed by forty cycles of PCR performed using a cycle profile 
of denaturation at 94°C for 1 nin, annealing at 55°C for 1 min, 
and extension at 72 °C for 2 ir.in. Five-/il aiiquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % 
25 agarose gels (NuSieve, FMC) . 

Subcloning of PCR fragment 

Fortyf ive-/il aiiquots of the PCR products generated as 
described above were purified using QIAquick PCR purification 

30 kit (Qiagen, USA) according to the manufacturer's instructions. 
The purified DNA was eluted in 50 /il of lOxnM Tris-HCl, pH 8.5. 
Fifty -Ml of the purified PCR fragment was digested with BamHI 
and Apal, electrophoresed in 0.8 I low gelling temperature 
agarose (SeaPlaque GTG , FMC) gels, the relevant fragment was 

35 excised from the gels, and purified using QIAquick Gel 

extraction Kit (Qiagen, USA) according to the manufacturer's 
instructions. Purified fragment was ligated to gelpurified BamHI 
and Apal digested pMB33 3. The ligation was used to transform 
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B.sufctilis DN1885. Transformed cells .were plated on LBPG agar 
plates containing 5 fig /ml Erythromycin and incubated overnight 
at 33 °C. Clones were restreaked on similar plates and plasmids 
were isolated from overnight cultures of clones incubated at 
5 3 3 °C in TY with 5 Erytrhomycin plasmids were isolated 

using Qiaquick plasmid kit as described by manuf acturer - 
Plasmids were checked by enzymatic digestion using Apal and 
BamHI. One correct clone was termed MB3 39 and the plasmid 
pMB339. 

10 

4) 

We wanted to test an easy way of .monitoring that hybrid plasmids 
had formed. Thus we wanted to monitor the forced recombination 
between the two homologous genes on the two seperate plasmids. 

15 This was done by constructing a pE194 derivative having a pro- 
moter-lacking Savizyn transcr iopt ionally fused to an ORF encod- 
ing GFP . In this way there would be no expression of GFP (due to 
lack of promoter upstream of the Savizyn and thus the GFP gene) 
unless the two homologous genes had recombined and thus getting 

20 the GFP under control of the promoter otherwise present on 
pSX120. The plasmid was constructed by PCR amplifying the GFP 
gene and introducing it just downstream of the savizyn gene on 
the pM3339. 

The GFP encoding DNA used herein is a derivative of 
25 the original wildtype gene, the derivative of GFP was cloned 
from the DNA construction of the mutant F64L-S65T-GFP which was 
constructed as described in international patent application WO 
97/11094. 

The construction of the template plasmid was de- 
30 scribed in WO 97/11094. The DNA fragment encoding the F64L-S65T- 
GFP was PCR amplifyed from the E.coli plasmid encoding this gene 
as follows: 

1 fsl of pF64L-S65T-Gr ? Qiaquick plasmid prep was used 
for -50 Ml PCR amplification in PCR buffer (10 mM Tris-HCl, pH 
35 8.3, 50 mM KCl, 1.5 m>i MgCl 2 , 0.01 % (w/v) gelatin) containing 
200 /iM of each dNTP, 2.5 units of AmpliTaq polymerase (Per- 
kin-Elmer, Cetus, USA) and 100 pmol of each primer: 
#20231 
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5 1 -AACTGCAGAAGATGTGGACGCGC-3 1 
#101381 

5 • -GCCCAATGCATAAACTGCATCCCTTAACTTGTTTTATTTGTATAGTTCATCCATGCCATG- 
3 • 

5 

The PCR reactions was performed using a DNA thermal 
cycler (Landgraf , Germany) . One incubation at 94°C for 1 min 
followed by forty cycles of PCR performed using a cycle profile 
of denaturation at 94°C for 1 min, annealing at 55°C for 1 min, 
10 and extension at 72 °C for 2 min. Five-/il aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % 
agarose gels (NuSieve, FMC) . 

subcloninq of PCR fragment 

15 Forty-f ive-/il aliquots of the PCR products generated as 

described above were purified using QIAquick PCR purification 
kit (Qiagen, USA) according to the manufacturer's instructions. 
The purified DNA was eluted in 50 /U of lOmM Tris-HCl, pH 8.5. 
The PCR fragment and pMB339 were digested with Clal and Nsil, 

20 gelpurified as above and ligated together. The ligation mixture 
was used to transform DN1885, transformed cells were plated on 
LBPG 5 /jg/ml Erythromycin. After 18 hours of incubation green 
flourescent cells were visible as single colonies. One of these 
were analysed and saved as K34 0 6 and the corresponding plasmid as 

25 pr-3406. Since the GFP encoding ORF aid not have an upstream pro- 
moter driving the transcription a possible explanation for the GF? 
expression could be a read through from a promoter further up- 
straem of the Savizyn-GFP transcriptional fusion, namely the pro- 
moter of the repF gene. 

30 

5) 

In order two abolish any read-through into the ORF of GFP, a tran- 
scriptional terminator was inserted between the repF gene and the 
Savizyn DNA. The terminator was amplified from the pMUTIN-4-MCS 
35 plasmid using the following primers and conditions: 

1 fil of pMUTIN-4-MCS Qiaquick plasmid prep was used for 50 /il 
PCR amplification in PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM 
KC1, 1.5 mM MgCl 2/ 0.01 % (w/v) gelatin) containing 200 /zM of 
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each dNTP, 2.5 units of AmpliTaq polymerase (Perkin-Elmer , 

Cetus, USA) and 10 0 pmol of each primer: 

#102588 

5 1 -CGGACGGTACCGGTAATGACTCTCTAGCTTGAGGC-3 1 
5 #102589 

5 ■ -GGGTACAGATCTCCGCGGCGCAAAAAGGCCATCCGTCAGG-3 ' 

The PCR reactions was performed using a DNA thermal cycler 
(Landgraf , Germany) . One incubation at 94°C for 60 sec followed 
by forty cycles of PCR performed using a cycle profile of dena- 

10 turation at 94°C for 30 sec, annealing at 55°C for 30 min, and 
extension at 72 °C for 1 min. Five-^1 aliquots of the ampli- 
fication product was analyzed by electrophoresis in 0.7 % aga- 
rose gels (NuSieve, FMC) . 

The amplified DNA was purified as above and cloned as a Kpnl— 

15 Bglll fragment in pHB406 also Bglll and Kpnl digested (same proce- 
dure for gelpurifying etc. as decribed above). The ligation mix- 
ture was used to transform DN1885, transformed cells were plated 
on LBPG 5 Mg/ml Erythromycin. After 18 hours of incubation cells 
were visible as single colonies. One of these were ananlysed and 

20 saved as MB4 3 0 and correspondingly the plasmid pMB430. This clone * 
had absolutely no expression of GF? indicating that the cloned 
terminator in fact worked as anticipated. 

The resulting sequence of the entire plasmid pM34 30 is listed as 
SEQ NO 1. 

25 

Establishing the MB4 3 2 strain 

PL1801 Bacillus subtilis was transformed with the two plasnids 
pMB4 3 0 and pSX120 using approximately 1 pg of each plasmid per 100 
/il of competent PL1S01. The transformed cells were plated onto 

30 LBPG-5 M9/ ml Erythromycin, incubated overnight at 33 °C. Next day 
colonies were restreaked onto LBPG-5 pg/ml Erythromycin + 6 pg/ml 
Chloramphenicol and incubated overnight at.33°C. Clones were also 
grown i liquid culture TY- 5 /ig/r.l Erythromycin + 6 tig/nl Chloram- 
phenicol overnight at 33 °C, plasnids were purified from culture 

35 broth and the existence of both pMB4 3 0 and pSX120 in the PL1801 
strain was verified. This clone was saved and termed MB432. 
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EXAMPLE 1 

Sequence shuffling of two homologous sequences both coding for 
the subtilisin gene of Savinase 

5 The Savinase wt gene (comprised in pSX120) coding for a 

protease of the subtilisin type was chosen as the target for 
shuffling of highly homologous genes. A site-directed variant of 
the Savinase gene (hereafter called Savizyn, comprised in 
pSX222, where a large number of silent mutations giving new re- 

ao striction sites are introduced, was used as the shuffling part- 
ner to Savinase. The two genes were cloned on two separate com- 
patible plasmids according to the invention. A transcriptionally 
active Savinase gene was cloned on a plasmid based on the pUBHO 
origin coding for resistance to chloramphenicol (pSX120, fig. 

15 4). A promoter less Savizyn gene was cloned on a derivative of 
the temperature sensitive plasmid pE194 which also codes for re- 
sistance to erythromycin (pM3430) . Two transcription terminators 
were cloned upstream of the Savizyn gene to prevent any read- 
through. 

20 In pMB430 the green florescence protein (GFP) was tran- 

scriptionally fused to the Savizyn gene in order to be able to 
visually follow the in-cross and out-cross events between the 
two plasmids. 

B.subtilis strain K3432, pLIBOl transformed with the two 
25 plasmids pSX120 and p!-S34 30, were taken through two cycles of 
temperature shifts to induce two consecutive double-cross over 
events between the homologous genes. Homologous recombination 
and plasmid hybrid formation was selected on LBPG plates at the 
restrictive temperature (47-50°C) in the presence of erythromy- 
30 cin. Under these restrictive conditions, the temperature sensi- 
tive plasmid pMB4 3 0 is forced into the stable pSX120 plasmid by 
homologous recombination in the region of Savinase/Savizyn . 
These colonies where recombination has occurred also becomes 
fluorescent as a result of the recombination between the two 
35 plasmids and concomitant activation of the Savizyn-GFP operon 
(fig. 2) . Out-cross events were screened by growing the clones 
in LB-broth at the permissive temperature of 30°C for two days 
and plating the cells on LBPG plate with chloramphenicol and 
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erythromycin. The non-fluorescent colonies were restreaXed on 
plates and taken through a second round of temperature cycle as 
described above. 

After the second round of shuffling, plasmid preparations 
5 were made from 2 0 different clones. Competent PL18 01 was trans- 
formed with each of the 2 0 plasmid preps and plated on LBPG sup- 
plemented with erythromycin. The plates were replicated onto 
LBPG plates with chloramphenicol and erythromycin to identify 
unwanted clones where both pSX120 and pMB430 had been trans- 

10 formed. Clones where only the pMB430 plasmid had been trans- 
formed were selected for sequencing of the Savizyn region to 
identify if shuffling had taken place. It is evident from figure 
6 that recombination between the two genes Savinase and Savizyn 
had taken place. All of the 12 sequenced clones show some degree 

15 of recombination and A of these can positively be identified as 
being result of two consecutive double recombination events. It 
is important to note that exchange of Savinase and Savizyn se- 
quence can only be observed if the shuffled fragment overlaps a 
restriction site that is unique for Savizyn. The four clones 

20 with a positive evidence for two double recombinations is there- 
fore the minimum number since the recombination could have oc- 
curred between two neighbouring restriction sites. 
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('2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmid pMB430 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GA7CAG7ACA AGAAAGATAC TGTATTTCAT AAACAGCAAC 7CCAAGAACT TAAGGATGAG 60 

TTACAGAAGG CAAATAAGCA GTTACAGAGT GG AA7AC AGC ATATGAGGTC TACGAAACCC 120 

TTTGATTATG AAAA7CAGCG 7ACAGC777G 77C7C7CGAC GTGAAGAGAC 7CGTACAAAC 180 

A7A77AAC7G C7GA7GAA77 7GAACCCC7G CAACAAAC.^A 7C7CTTCTGC AGAACCGA77 240 

G77GA7CA77 ACGAAAA7A7 TAACAGCACA GAC7A77ACA CAGAAAATCA AGAA7TAAAA 300 

AAACGTAGAG AGAG777GAA AGAAGTAG7G AA7ACA7GGA AAGAGGCGTA TCACGAAAAA 360 

AG7AAAGAGG T7AA7AAA77 AAAGCGAGAG AATGA7AG77 TGAA7GAGCA GT7GAA7G7A 4 20 

TCAGAGAAA7 TTCAAGC7AG TACAGTGAC7 77A7A7CG7G CTGCGAGGGC GAATTTCCC7 4 80 

GGGTT7GAGA AAGGG777AA TAGGCTTAAA GACAAA77C7 TTAATGATTC CAAATT7GAG 540 

CGTG7GGGAC AGTTTA7GGA 7GT7GTACAG GA7AA7G7CC AGAAGG7CGA TAGAAJVGCG7 600 

GAGAAACAGC GTACAGACGA TTTAGAGA7G TAGAGG7AC7 TTTATGCCGA GAAAACTT77 660 
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TGCGTGTGAC AGTCCTTAAA AT AT ACT TAG AGCGTAAGCG AAAGTAGTAG CGACAGCTAT 720 
TAACTTTCGG TTGCAAAGCT CTAGGATTTT TAATGGACGC AGCGCATCAC ACGCAAAAAG 780 
GAAATTGGAA TAAATGCGAA ATTTGAGATG TTAATTAAAG ACCTTTTTGA GGTCTTTTTT 840 
TCTTAGATTT TTGGGGTTAT TTAGGGGAGA AAACATAGGG GGGTACTACG ACCTCCCCCC 900 
TAGGTGTCCA TTGTCCATTG TCCAAACAAA TAAATAAATA TTGGGTTTTT AATGTTAAAA 960 

GGTTGTTTTT TATGTTAAAG TGAAAAAAAC AGATG7TGGC AGGTACAGTG ATGGTTGTAG 1020 

ATAGAAAAGA AGAGAAAAAA GTTGCTGTTA CTT7AAGAC7 TACAACAGAA GAAAA7GACA 1080 

TATTAAATAG AATCAAACAA AAATATAATA TTACCAAA7C AGATCCAACC GGTATTCTAA 1140 

TAAAAAAATA TGCAAAGGAG GAATACGGTG CA77T7AAAC AAAAAAAGAT AGACAGCACT 1200 

GCCA7CC7CC CTATCTATGA CTAAATTTTG T7AAG7G7 A7 7AGCACCG77 ATTA7A7CA7 12 60 

GAGCGAAAAT G7AATAAAAG AAAC7CAAAA CAAGAAAAA7 TCAAGACGAC CTAA77GGAC 1320 

A77TG7777A TA7CCACAAT CACCAAAAGC CGAGTGGT7A GAGTAT7TAA AAGAG77ACA 13 80 

CA77CAAT7T G7AG7G7C7C CA77ACA7G A 7AGGGATACT GA7ACAGAAG G7AGGA7GAA 1440 

AAAAGAGCAT 7ATCATA77C TAGTCA7G7A TGAGGG7AA7 AAA7C7TA7G AACAGA7AAA 1500 

AA7AA7TAAC AGAAGAA77G AA7GCG ACTA 77CCGCAGA7 7GCAGGAAGT G7GAAAGG7C 1560 

TTGTGAGATA TATGCT7CAC A7GGACGA7C CTAA7AAATT 7AAATATCAA AAAGAAGATA 1620 
TGATAGTT7A TGGCGG7G7A GATG77GATG AA77A77AAA GAAAACAACA ACAGATAGAT 1680 
ATAAATTAA7 TAAAGAAATG AT7GAGT7TA T7GA7GAACA AGGAATCGTA GAATTTAAGA 1740 
GTTTAATGGA TTATGCAATG AAGT7TAAAT TTGA7GA7TG GT7CCCGCTT TTATGTGATA 1800 
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ACTCGGCCTA TGTTATTCAA GAATATATAA AATCAAATCG GTATAAATCT GACCGATAGA 1860 

TTTTGAATTT AGGTGTCACA AGACACTCTT TTTTCGCACC AGCGAAAACT GGTTTAAGCC 1920 

GACTGCGCAA AAGACGAATT CGAGCTCGGT ACCGGTAATG ACTCTCTAGC TTGAGGCATC 1980 

AAATAAAACG AAAGGCTCAG TCGAAAGACT GGGCCTTTCG TTTTATCTGT TGTTTGTCGG 2040 

TGAACGCTCT CCTGAGTAGG ACAAATCCGC CGCTCTAGCT AAGCAGAAGG CCATCCTGAC 2100 

GGATGGCCTT TTTGCGCCGC GGAGATCTAA ATATTCGTC3G TGGCGCAAGC TTTGTACCAG 2160 

CCGAACCG7C GACTCAAGAT GGGAATGGGC ATGCCACGCA TGTCGCCGGG ACGATCGCTC 2220 

C777AAACAA TTCGATTGGC GTTCTTGGCG TACCCCCCAC CGC7CACC7A 7ACGCTCTTA 2280 

AAGTCCTAGG GGCGACCGGT TCAGGTTCCG 7CAGC7CG AT TGCCCAAGCA TTGGAATGGG 2340 

CAGGGAACAA TGGCATGCAC GTTGCTAATT TGAGTTTAGG AAGCCCTTCG CCAAGTGCCA 2400 

CACTCGAGCA ACCTC77AAT AGCGCGACTT CTAGAGGCGT 7CTTG77GTA GCGGCATCTG 2460 

CCAA77CACG TGCAGGCTCA ATCAGCTATC CGGCGCGCTA TGCGAACGCA ATGGCAGTCG 2520 

GAG C7ACTG A TCAAAACAAC AACCGCGCTA GCTTTT CACA GTATCCCGCA GGCCTTGACA 2580 

TTC7CCCACC CGGGGTAAAC CTGCAGAGCA CATACCCAGG TTCAACA7A? GCCAGCTTAA 2 640 

ACGGTACATC GATGGCTACT CCTCATGTTG CAGG7GCGGC CGCCCTTGTT AAACAAAAGA 2700 

ACCCATCTTG GTCTAATGTA CAAATTCGAA ATCA7CTAAA GAA7ACGGCA ACTACTTTAG 27 60 

GAAGCACGAA CTTGTATGGA AGCGGAC77G T7AACGCAGA AGCGGCAACG CG7TAAAAAT 2820 

GAGGAGGGAA GCTTTATGAG TAAAGGAGAA GAAC77TTCA CTGGAGTTGT CCCAATTCTT 2880 

G77GAATTAG ATGGCGATGT TAATGGGCAA AAA7TCTCTG TTAG7GGAGA GGGTGAAGGT 2940 
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GATGCAACAT ACGGAAAACT TACCCTTAAA TTTATTTGCA CTACTGGGAA GCTACCTGTT 3000 

CCATGGCCAA CGCTTGTCAC TACTCTCTCT TATGGTGTTC AATGCTTTTC TAGATACCCA 3060 

GATCATATGA AACAGCATGA CTTTTTCAAG AGTGCCATGC CCGAAGGTTA TCTACAGGAA 3120 

AGAACTATAT TTTACAAAGA TGACGGGAAC TACAAGACAC GTGCTGAAGT CAACTTTCAA 3180 

GGTGATACCC TTGTTAATAG AATCGAGTTA AAACGTATTG ATTTTAAAGA AGATCGAAAC 3240 

ATTCTTGGAC ACAAAATGGA ATACAATTAT AAC7CACA7A ATGTATACAT CATGGCAGAC 3300 

AAACCAAAGA ATGGCATCAA AGTTAACTTC AAAATTAGAC ACAACATTAA AGATGGAAGC 3360 

GTTCAATTAG CAGACCATTA TCAACAAAAT ACTCCAAT7G GCGATGGCCC TCTCCTTTTA 34 20 

CCAGACAACC ATTACCTCTC CACGCAATCT GCCC777CCA AAGATCCCAA CGAAAAGAGA 3480 

GATCACATCA TCCTTCTTGA C777G7AACA GCTCCTGGG A TTACACATGC CATGGATGAA 3540 

CTATACAAAT AAAACAAGTT AAGGGA7CCA G77TA7GCA7 CCCTTAACTT ACTTATTAAA 3 600 

TAA777A7AG CTATTGAAAA GAGA7AAGAA TTGTTCAAAG CTAATATTGT TTAAATCGTC 3 660 

AATTCCTGCA TGTTTTAAGG AA7TG77AAA 77GA777777 G7AAATA777 7C77GTAT7C 3720 

T77GT7AACC CA777CA7AA CGAAA7AA77 A7AC7777G7 77A7C7T7G7 G7GA7A7TC7 3780 

TGA777777T C7AC77AA7C 7GATAAG7GA GC7A77CAC7 77AGGTT7AG GA7GAAAATA 3840 

TTC7C77GGA ACCA7AC77A A7ATAGAAA7 ATCAAC7TC7 GCCATTAAAA G7AA7GCCAA 3900 

7GAGCG7T77 GTAT77AA7A A7CT777AGC AAACCCG7A7 TCCACGA77A AATAAA7C7C 3960 

ATTAGCTATA CTATCAAAAA CAA7777GCG 7A77A7A7CC G7ACTTA7G7 TATAAGGTA7 4020 

ATTACCATAT ATTTTATAGG AT7GG7777T AGGAAA777A AAC7GCAATA TATCCTTGT7 4080 
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TAAAACTTGG AAATTATCGT GATCAACAAG TTTATTTTCT GTAGTTTTGC ATAATTTATG 4140 

GTCTATTTCA ATGGCAGTTA CGAAATTACA CCTCTTTACT AATTCAAGGG TAAAATGGCC 4200 

TTTTCCTGAG CCGATTTCAA AGATATTATC ATGTTCATTT AATCTTATAT TTGTCATTAT 4260 

TTTATCTATA TTATGTTTTG AAGTAATAAA CTTTTGACTG TGTTTTATAT TTTTCTCGTT 4320 

CATTATAACC CTCTTTAATT TGGTTATATG AATTTTGCTT ATTAACGATT CATTATAACC 4380 

ACTTATTTTT TGTTTGGTTG ATAATGAACT CTCCTGATTA CAAAAATACT AAAAATGCCC 4440 

ATATTTTTTC CTCCTTATAA AATTAG7ATA ATTATAGC«C CACCTCTGAT AAA7ATGAAC 4 500 

A7GA7GAGTG ATCGTTAAAT TTATACTCCA ATCGGA7GCG ATTATTGAAT AAAAGATATG 4 5 60 

AGAGATTTAT CTAATTTCTT TTTTCTTGTA AAAAAAGAAA GTTCTTAAAG GTTTTATAGT 4 520 

TTTGGTCGTA GAGCACACGG TTTAACGACT TAATTACGAA GTAAATAAGT CTACTGTGT7 4 680 

AGACTTTATG AAATCTATAT ACGTTTATAT ATATTTATTA TCCGGAGGTG TACCATGTCT 4740 

CATTCAATTT TGAGGGTTGC CACAC77AAA GGATCAAGTA ATACAAACGG GATACAAAGA 4800 

CATAATCAAA GAGAGAATAA AAAC7A7AA7 AA7AAAGACA TAAATCATGA GGAAACATAT 4 860 

AAAAA77ATG AT77GA77AA CGCACAAAA7 A7AAAGTA7A AAGATAAAA7 TGATGAAACG 4920 

ATTGA7GAGA AT7ATTCAGG GAAACG7AAA A7TCGGTCAG A7GCAA7TCG ACATGTGGAC 4980 

GGAC7GCT7A CAAGTGA7AA AGA777CTT7 GA7GA777AA GCGGAGAAGA AA7AGAACGA 5040 

TTTTTTAAAG ATAGCTTGGA G7T7CTAGAA AA7GAATACG GTAAGGAAAA TATGC7GTAT 5100 

GCGACTGTCC A7CTGGATGA AAGAGTCCCA CATATGCACT TTGGTT7TGT CCCTTTAACA 5160 

GAGGACGGGA GATTGTCTGC AAAAGAACAG T7AGGCAACA AGAAAGACTT TACTCAATTA 5220 
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CAAGATAGAT TTAATGAGTA TGTGAATGAG AAAGGTTATG AACTTGAAAG AGGCACGTCC 5280 
AAAGAGGTTA CAGAACGAGA ACATAAAGCG ATG 5313 
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Claims 

1. A process for in vivo recombination of homologous DNA se- 
quences, comprising the following steps 

5 a) incubating a cell containing at least two DNA struc- 

tures, each comprising a DNA sequence and an origin of 
replication, under conditions which favour integration 
of one of the DNA-structures into one of the others, 
thereby forming a hybrid DNA structure; 
10 b) incubating the cell undsr conditions which favour cross - 

ing out from said hybrid DNA structure, thereby forming 
novel DNA structures, each 'comprising a recombined DNA 
sequence and an origin of replication; 
c) repeating steps a)-b) at least once. 

15 

2. A process for in vivo recombination of homologous DNA se- 
quences, comprising the steps 

a) incubating a cell containing at least two DNA struc- 
tures, each comprising a DNA sequence and an origin of 

20 replication, under conditions where i) at least one ori- 

gin of replication is non-functional and ii) the cell is 
under a selective pressure which only allows growth of 
the cell if it contains the DNA structure which is hav- 
ing the non-functional origin (i.e. favoring an integra- 

25 tion of one of the DNA-structures into one of the oth- 

ers, thereby forming a hybrid DNA structure) ; 

b) changing of said conditions to conditions, where fewer 
origins of replication are non-functional than in step 
a) (i.e. favoring a crossing out from said hybrid DNA 

30 structure) ; 

c) repeating steps a)-b) at least once. 

3. The process according to claim 1-2, characterized in that 
at least one DNA structure is a chromosome, preferred a bacte- 

35 rial chromosome, more preferred a Bacillus chromosome, espe- 
cially a Bacillus subtilis chromosome. 
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4; The process according to claim 1-3, characterized in that 
at least one DNA structure is a vector, preferred a bacterial 
vector, more preferred a vector able to replicate in a Bacillus 
cell, especially a Bacillus subtilis cell. 

5 

5. The process according to claim 4, wherein at least two of 
the DNA constructs are two different vectors. 

6. A process for in vivo recombination of homologous DNA se- 
10 quences, comprising the steps 

a) inserting the homologous DNA sequences in at least two 
different vectors, said vectors contain different origins 
of replication; 

b) transferring the vectors into a cell; 

15 c) incubating the cell under conditions where i) at least one 
origin of replication is non-functional and ii) the cell 
is under a selective pressure which only allows growth of 
the cell if it contains the DNA structure which is having 
the non-functional origin (i.e. favoring an integration of 

20 one of the DNA-structures into one of the others, thereby' 

forming a hybrid DNA structure) ; 

d) changing of said conditions to conditions, where fewer 
origins of replication are non-functional than in step c) 
(i.e. favoring a crossing out from said hybrid DNA struc- 

25 ture) ; 

e) repeating steps c)-d) as many times as wanted, however at 
least once. 

7. The process according to claim 1-6, characterised in that 
30 at least one origin of replication is selected in such a way 

that the replication rate of the vector can be regulated chemi- 
cally or physically. 

8. .The process according to claim 7, characterised in that at 
35 least one origin of replication is temperature sensitive. 
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9. The process according to claim 8, characterised in that the 
replication rate of at least one vector can be regulated by a 
temperature shift. 

5 io. The process according to claim 1-9, characterised in that 
the vectors contain genes for markers, preferred selectable 
markers, especially antibiotic markers. 

11. The process according to claim 10, characterised in that the 
io vectors contain genes for different markers. 

12. The process according to any. of the preceding claims, 
wherein at least one DNA structure do not comprise ori(-) (i.e. 
only comprise Ori(+)). 

15 

13. The process according to claim 12, wherein the DNA structure 
which do not comprise Ori(-) is a DNA structure which origin 
(i.e. herein Ori(+)) is functional during the hybrid DNA struc- 
ture formation performed according a process according to any of 

20 the preceding claims. 

14. The process according to any of the preceding claims, 
wherein it is formation of the hybrid structure is measured in 
vivo, preferably by having a system of DNA structures, wherein 

25 an in vivo screenable protein is expressed only after the hybrid 
structure has been formed. 

15. The process according to claim 14, wherein said screenable 
protein is a protein which is fluorescent under suitable expo- 

30 sure, and in particular said fluorescent protein is Green Flo- 
rescent protein (GFP) or variants thereof. 

16. The process according to any of the preceding claims, 
wherein crossing out from the hybrid DNA structure is a forced 

35 crossing out event. 



17. The process according to claim 16, wherein the forced cross- 
ing out from the hybrid structure is performed by: 
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i) constructing a system wherein the DNA hybrid struc- 
ture do not express a selectable marker, and at 
least one of the at least two DNA structures accord- 
ing to any of the preceding claims is able to ex- 

5 press said selectable marker; 

ii) incubating the cell under selective conditions, 
thereby forcing a cross out of the DNA hybrid struc- 
ture, since the cell is only able to express the 
marker gene after the crossing out of the DNA hybrid 

10 structure has taken place. 

18. The process according to 17, wherein the selectable marker 
in step i) is expressed on a DNA structure where the origin of 
replication is non-functional during a forced integration of the 

is DNA structures into a DNA hybrid structure performed as de- 
scribed above, and preferably the selectable marker gene is an 
antibiotic resistance gene. 

19. The process according to any of the preceding claims, 

20 wherein the homologous DNA sequences comprise DNA sequences from 
at least one pre-made DNA library. 



20. The process according to any of the preceding claims, 
wherein the homologous DNA sequences are comprised of two or 

25 more different set of homologous DNA sequences ("set A" "set B", 
"setC" etc.), and wherein each of said set of homologous se- 
quences ("set A" "set B", "setC" etc.) are inserted in a differ- 
ent DNA construct ("DNA construct A", "DNA construct B" , "DNA 
construct C" etc..) according to any of the preceding claims, 

30 

21. The process according to claim 20, wherein at least one of 
said sets of homologous DNA sequences is derived from a pre-made 
DNA library, and at least one of said sets of homologous DNA se- 
quences is a DNA sequence which are homologous to said DNA li- 

35 brary (i.e. resulting in a process for in vivo recombination of 
said DNA library to a DNA sequence which are homologous to said 
DNA library) . 
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22. The process according to 20, wherein said sets of homologous 
DNA sequences is derived from the same pre-made DNA library 
(i.e. resulting in a process for in vivo recombination of said 
DNA library) . 

5 

23. The process according to claim 20, wherein said sets of ho- 
mologous DNA sequences are derived from at least two different 
pre-roade DNA libraries (i.e. resulting in a process for in vivo 
recombination of at least two different DNA libraries) . 

10 

24. The process according to claim 20, wherein said sets of ho- 
mologous DNA sequences each only -contain one DNA sequence (i.e. 
recombination of two or more homologous sequences) . 

15 25. The process according to any of the preceding claims, 

wherein two sets of homologous DNA sequences ("set A n and "set 
B") are recombined in vivo, co-prising the steps 

a) inserting sequence (s) from "set A" in a vector contain- 
ing a temperature resistant origin of replication and a 

20 gene for a first marker; 

b) inserting sequence (s) from "set B" in a vector contain- 
ing a temperature sensitive origin of replication and a 
gene for a second marker; 

c) transferring the vectors into a cell; 

25 d) incubating the cell at a temperature where both origins 

of replication are functional, optionally under a selec- 
tion pressure that favours cells with both marker genes; 

e) shifting the temperature to a temperature, where only 
the temperature resistant origin of replication is func- 

30 tional; 

f) repeating steps d)-e) at least once. 



26. The process according to any of the preceding claims, 
wherein two sets of homologous DNA sequences ("set A" and "set 
35 B") are recombined in vivo, comprising the steps 

a) inserting sequence (s) from "set A" in a vector containing a 
temperature resistant origin of replication and a gene for a 
first marker; 
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b) inserting sequence (s) from "set B n in a vector contain- 
ing a temperature sensitive origin of replication and a 
gene for a second marker; 

c) transferring the vectors into a cell; 

5 d) incubating the cell at a temperature where only the tem- 

perature resistant origin of replication is functional, 
optionally under a selection pressure that favours cells 
with both marker genes; 

e) optionally, shifting the temperature to a temperature, 
10 where both origins of replication are functional; 

f) repeating steps d)-e) at least once. 

27. The process according to claim 1-26, characterised in that 
the cell containing the DNA structures is defective or transito- 

15 rily inactivated in the enzymatic mismatch repair system or has.: 
a reduced level of mismatch repair. 

28. The process according to claim 1-27, characterised in that 
the cell is a Gram positive cell, such as a Staphylococcus cell-, 

20 a Streptococcus cell, a Bacillus cell, or in particular a Bacil- 
lus subtilis cell. 

29. The process according to claim 1-28, characterised in that 
the cell is a Gran negative cell, preferred an Escherichia cell, 

25 especially an E. coli cell. 

30. The process according to claim 1-29, characterised in that 
the homologous DNA sequences are encoding polypeptides, espe- 
cially enzymes, and in particular enzymes such as an amylase, 

30 lipase, cutinase, cellulase, oxidase, phytase, and a protease. 

31. A method for producing one or more recombinant protein (s) 
having a desired biological activity comprising: 

a) " performing a process for in vivo recombination of 
35 homologous DNA sequences according to any of the preceding 

claims, wherein at least one of the DNA structure (s) is an 
expression vector; 
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b)* 



c) 



expressing the numerous different recombinant polypeptides 
encoded by the numerous different recombined DNA sequences 
from step a) ; and 

screen or select the numerous different recombinant 
proteins from step b) in a suitable screening or selection 
system for one or more recombinant polypeptides (s) having 
a desired activity. 

32. A set of at least two vectors, where at least two vectors 
are comprising at least two individually different origins of 
replication able to function in the same cell under permissive 
conditions, and two homologous DNA' sequences encoding polypep- 
tides, said DNA sequences having more than 50% homology, prefer- 
able more than 70%. 

33. A vector system, comprising at least 

a) a vector A containing a DNA sequence encoding a polypep- 
tide and an origin of replication 

b) a vector B containing a DNA sequence homologous to the 
DNA sequence in vector A, and an origin of replication 
that differs from the origin of replication in vector A. 

34. The vector system according to claim 33, characterised in 
that the DNA sequences encodes polypeptides with a biological 
activity, especially enzymes. 

35. A cell containing a vector system according to claim 33-34. 

36. The cell according to claim 35, characterised in that is a 
) Gran positive cell, preferred a Bacillus cell, especially a Ba- 
cillus subtilis cell. 

37. The cell according to claim 35, characterised in that the 
cell. is a Gram negative cell, preferred an Escherichia cell, es- 

5 pecially an E. coli cell. 

38. A recombined DNA sequence, characterised in that it is a 
result of a process according to one or more of the claims 1-30, 
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or it is encoding a recombinant protein having a desired bio- 
logical activity, wherein said recombinant protein having a de- 
sired biological is produced by the method according to claim 
31. 

5 

39. A process for producing a polypeptide, especially an en- 
zyme, comprising i) growing a cell expressing a polypeptide en- 
coded by a recombined DNA sequence according to claim 38, and 
ii) purifying said expressed polypeptide. 

10 

40. A polypeptide, especially an enzyme, characterised in that 
it is produced by a process according to claim 39. 

41. A polypeptide, especially art enzyme, characterised in that 
15 it is encoded by the recombined DNA sequence according to clain 

38 . 
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